SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Celeste, Anthony J. 

Wozney, John 
Rosen, Vicki A. 
Wolfman, Neil. 
Thomsen, Gerald H. 
Melton, Douglas A. 

(ii) TITLE OF INVENTION : TENDON- INDUCING COMPOSITIONS 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: GENETICS INSTITUTE, INC. 

(B) STREET: 87 ^CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.2 5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lazar, Steven R. 

(B) REGISTRATION NUMBER: 32,618 

(C) REFERENCE /DOCKET NUMBER: 5202 -D 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 498-8260 

(B) TELEFAX: 617 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: vl-1 

(ix) FEATURE: 

(A) NAME /KEY : mat__peptide 

(B) LOCATION: 571 .. 882 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
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(B) LOCATION: 1..882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GCG CGT AAT ACG ACT CAC TAT AGG GCG AAT TGG GTA CGG GGC CCA GGC 4 8 

Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 
-190 -185 -180 -175 

AGC TGG ACT TCT CCG CCG TTG CTG CTG CTG TCC ACG TGC CCG GGC GCC 96 
Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 
-170 -165 -16.0 

GCC CGA GCG CCA CGC CTG CTG TAC TCG CGG GCA GCT GAG CCC CTA GTC 144 
Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 
-155 -150 -145 

GGT CAG CGC TGG GAG GCG TTC GAC GTG GCG GAC GCC ATG AGG CGC CAC 192 
Gly Gin Arg Trp Glu Ala She Asp Val Ala Asp Ala Met Arg Arg His 
-140 -135 -130 

CGT CGT GAA CCG CGC CCC CCC CGC GCG TTC TGC CTC TTG CTG CGC GCA 24 0 

Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 
-125 -120 -115 

GTG GCA GGC CCG GTG CCG AGC CCG TTG GCA CTG CGG CGA CTG GGC TTC 2 88 

Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 
-110 -105 -100 -95 

GGC TGG CCG GGC GGA^GGG GGC TCT GCG GCA GAG GAG CGC GCG GTG CTA 336 
Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 
-90 -85 -80 

GTC GTC TCC TCC CGC ACG CAG AGG AAA GAG AGC TTA TTC CGG GAG ATC 3 84 

Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie 
-75 -70 -65 

CGC GCC CAG GCC CGC GCG CTC GGG GCC GCT CTG GCC TCA GAG CCG CTG 43 2 

Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 
-60 -55 -50 

CCC GAC CCA GGA ACC GGC ACC GCG TCG CCA AGG GCA GTC ATT GGC GGC 480 
Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly 
-45 -40 -35 

CGC AGA CGG AGG AGG ACG GCG TTG GCC GGG ACG CGG ACA GCG CAG GGC 52 8 

Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 
-30 -25 -20 -15 

AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC AGG GGC CGG AGC CGC 576 
Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 
-10 -5 1 

TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG GAG CTC GGC TGG GAC 6 24 

Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 
5 10 15 

GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG TAC CAC TGC GAG GGC 6 72 

Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly 
20 25 30 

CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC AAC CAT GCC 72 0 

Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 
35 40 45 50 
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ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA GAC GCG GCG CCG GCC 768 
lie lie Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 
55 60 65 

TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC AGC ATC CTC TAC ATC 816 
Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser lie Leu Tyr lie 
70 75 80 

GAC GCC GCC AAC AAC GTT GTC TAC AA<? £AA ; .TAC .GAG GAC ATG GTG GTG 864 
Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 
85 90 95 

GAG GCC TGC GGC TGC AGG . TAGCGCGCGG GCCGGGGAGG GGGCAGCCAC 912 
Glu Ala Cys Gly Cys Arg 
100 

GCGGCCGAGG ATCC 926 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 
-190 -185 -180 ' -175 

Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 
-170 -165 -160 

Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 
-155 -150 -145 

Gly Gin Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His 
-140 -135 -130 

Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 
-125 -120 -115 

Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 
-110 -105 -100 -95 

Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 
-90 -85 -80 

Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie 
-75 -70 -65 

Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 
-60 -55 -50 

Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly 
-45 -40 -35 

Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 
-30 ~ .25 -20 -15 

Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 
-10 -5 l 
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Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 
5 10 15 

Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr- His Cys Glu Glv 
20 25 30 

Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 
35 40 45 50 

He He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 
55 60 65 

Ser Cys Cys Val Pro Ala 'Arg Leu Ser Pro He Ser He Leu Tyr He 
7 ° 75 80 

Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 
85 90 95 

Glu Ala Cys Gly Cys Arg . 
100 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: MP52 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 845.. 1204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACCGGGCGGC CCTGAACCCA AGCCAGGACA CCCTCCCCAA ACAAGGCAGG CTACAGCCCG 
GACTGTGACC CCAAAAGGAC AGCTTCCCGG AGGCAAGGCA CCCCCAAAAG CAGGATCTGT 
CCCCAGCTCC TTCCTGCTGA AGAAGGCCAG GGAGCCCGGG CCCCCACGAG AGCCCAAGGA 
GCCGTTTCGC CCACCCCCCA TCACACCCCA CGAGTACATG CTCTCGCTGT ACAGGACGCT 
GTCCGATGCT GACAGAAAGG GAGGCAACAG CAGCGTGAAG TTGGAGGCTG GCCTGGCCAA 
CACCATCACC AGCTTTATTG ACAAAGGGCA AGATGACCGA GGTCCCGTGG TCAGGAAGCA 
GAGGTACGTG TTTGACATTA GTGCCCTGGA GAAGGATGGG CTGCTGGGGG CCGAGCTCCG 
GATCTTGCGG AAGAAGCCCT CGGACACGGC CAAGCCAGCG GCCCCCGGAG GCGGGCGGGC 
TGCCCAGCTG AAGCTGTCCA GCTGCCCCAG CGGCCGGCAG CCGGCCTCCT TGCTGGATGT 
GCGCTCCGTG CCAGGCCTGG ACGGATCTGG CTGGGAGGTG TTCGACATCT GGAAGCTCTT 
CCGAAACTTT AAGAACTCGG CCCAGCTGTG CCTGGAGCTG GAGGCCTGGG AACGGGGCAG 
GGCCG7GGAC CTCCGTGGCC TGGGCTTCGA CCGCGCCGCC CGGCAGGTCC ACGAGAAGGC 
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CCTGTTCCTG GTGTTTGGCC GCACCAAGAA ACGGGACCTG TTCTTTAATG AGATTAAGGC 780 

CCGCTCTGGC CAGGACGATA AGACCGTGTA TGAGTACCTG TTCAGCCAGC GGCGAAAACG 84 0 

GCGG GCC CCA CTG GCC ACT CGC CAG GGC AAG CGA CCC AGC AAG AAC CTT 889 
Ala Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu 
1 5 10 * 15 

AAG GCT CGC TGC AGT CGG AAG GCA CTG * CAT GTC AAC TTC AAG GAC ATG 937 
Lys Ala Arg Cys Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met 
20 25 30 

GGC TGG GAC GAC TGG ATC" "ATC GCA CCC CTT GAG TAC GAG GCT TTC CAC 985 
Gly Trp Asp Asp Trp lie lie Ala Pro. Leu Glu Tyr Glu Ala Phe His 
35 40 45 

TGC GAG GGG CTG TGC GAG TTC CCA TTG CGC TCC CAC CTG GAG CCC ACG 103 3 

Cys Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
50 55 60 

AAT CAT GCA GTC ATC CAG ACC CTG ATG AAC TCC ATG GAC CCC GAG TCC 1081 
Asn His Ala Val lie Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser 
65 70 75 

ACA CCA CCC ACC TGC TGT GTG CCC ACG CGG CTG AGT CCC ATC AGC ATC 1129 
Thr Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser Pro lie Ser lie 
80 85 90 95 

CTC TTC ATT GAC TCTU-J3CC AAC AAC GTG GTG TAT AAG CAG TAT GAG GAC 1177 
Leu Phe lie Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp 
100 105 110 

ATG GTC GTG GAG TCG TGT GGC TGC AGG TAG 12 07 

Met Val Val Glu Ser Cys Gly Cys Arg 
115 * 120 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ala Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu Lys 
1 5 ~ 10 15 

Ala Arg Cys Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly 
20 25 30 

Trp Asp Asp Trp lie He Ala Pro Leu Glu Tyr Glu Ala Phe His Cys 
35 40 45 

Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn 
50 55 60 

His Ala Val He Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr 
65 70 75 80 

Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser Pro He Ser He Leu 
85 90 95 



53 



Phe lie Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met 
100 105 110 

Val Val Glu Ser Cys Gly Cys Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : -linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: *iomo Sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VI -l fragment 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 28.. 102 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCTGGA AGGATTGGAT CATTGCG CCG CTG GAC TAC GAG GCG TAC CAC 51 

Pro Leu Asp Tyr Glu Ala Tyr His 
1 5 

TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC 99 
Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
10 15 20 

AAC CACGCTATAG TCCAAACCTT TCTAGA top 
Asn 128 
25 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Leu Cys Asp Phe Pro 
1 5 10 15 

Leu Arg Ser His Leu Glu Pro Thr Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinole 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VL-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 28.. 102 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGATCCTGGG ATGACTGGAT TATGGCG CCG CTG GAC TAC GAG GCG TAC CAC 51 

Pro Leu Asp Tyr Glu Ala Tyr His 
1 5 

TGC GAG GGT GTA TGC GAC TTC CCG CTG CGC TCG CAC CTG GAG CCC ACC 99 
Cys Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
10 15 20 

AAC CACGCCATGC TACAAACGCT TCTAGA 128 
Asn 
25 



(2) INFORMATION FOR^gEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION.: SEQ ID NO : 8 : 

Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
1 5 10 15 

Leu Arg Ser His Leu Glu Pro Thr Asn 

20 25 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE : 

(B) CLONE: pALVl-781 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CTAACTACCC AACTCAAAAA AAAAAAAAAA AAAAACCCCC TCTAACCCCC ATTGACGAAA 6 0 

GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC 12 0 

GTC.-.GGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT 18 0 
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ACATTCAAAT ATGTATC CGC TCATGAGACA ATAACCCTGA 
AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC 
ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG 
TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC 
GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT 
CGCGGTATTA TCCCGTATTG ACGCCGGGCA AGAGCAACTC 
TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG 
AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT 
TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT 
TGTAACTCGC CTTGATCGTT GGGAACCGGA GCTGAATGAA 
TGACACCACG ATGCCTGTAG CAATGGCAAC AACGTTGCGC 
ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG 
ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT 
TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA 
CGTAGTTATC TACACGACCJG- GGAGTCAGGC AACTATGGAT 
TGAGATAGGT GCCTCACTGA TTAAGCATTG GTAACTGTCA 
ACTTTAGATT GATTTAAAAC TTCATTTTTA ATTTAAAAGG 
TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG 
CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT 
GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG 
TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA 
GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA 
GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG 
CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC 
ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA 
AGAAAGCGCC ACGCTTGCCG AAGGGAGAAA GGCGGACAGG 
CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC 
TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG 
GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG 
TTTTGCTCAC ATGTTCTTTC CTGCGTTATC CCCTGATTCT 
CTTTGAGTGA GCTGATACCG CTCGCCGCAG CCGAACGACC 
CGAGGAAGCG GAAGAGCGCC CAATACGCAA ACCGCCTCTC 
TTAATG C AG A ATTGATCTCT CACCTACCAA ACAATGCCCC 
ATAAAAAACA TACAGATAAC CATCTGCGGT GATAAATTAT 

56 




TAAATGCTTC 


AATAATATTG 


240 


CTTATTCCCT 


TTTTTGCGGC 


300 


AAAGTAAAAG 


ATGCTGAAGA 


360 


AACAGCGGTA 


AGATCCTTGA 


420 


• TTTAAAGTTC 


TGCTATGTGG 


480 


GGTCGCCGCA 


TACACTATTC 


540 


CATCTTACGG 


ATGGCATGAC 


600 


AACACTGCGG 


CCAACTTACT 


660 


TTGCACAACA 


TGGGGGATCA 


720 


GCCATACCAA ACGACGAGCG 


780 


AAACTATTAA 


CTGGCGAACT 


840 


GAGGCGGATA AAGTTGCAGG 


900 


GCTGATAAAT 


CTGGAGCCGG 


960 


GATGGTAAGC CCTCCCGTAT 


1020 


GAACGAAATA 


GACAGATCGC 


1080 


GACCAAGTTT 


ACTCATATAT 


114 0 


ATCTAGGTGA AGATCCTTTT 


1200 


TTCCACTGAG 


CGTCAGACCC 


1260 


CTGCGCGTAA 


TCTGCTGCTT 


1320 


CCGGATCAAG 


AGCTACCAAC 


1380 


CCAAATACTG 


TCCTTCTAGT 


1440 


CCGCCTACAT 


ACCTCGCTCT 


1500 


TCGTGTCTTA 


CCGGGTTGGA 


15 6*0 


TGAACGGGGG 


GTTCGTGCAC 


1620 


TACCTACAGC 


GTGAGCATTG 


1680 


TATCCGGTAA 


GCGGCAGGGT 


1740 


GCCTGGTATC 


TTTATAGTCC 


1800 


TGATGCTCGT 


CAGGGGGGCG 


1860 


TTCCTGGCCT 


TTTGCTGGCC 


1920 


GTGGATAACC 


GTATTACCGC 


1980 


GAGCGCAGCG 


AGTCAGTGAG 


2040 


CCCGCGCGTT 


GGCCGATTCA 


2100 


CCTGCAAAAA 


ATAAATTCAT 


2160 


CTCTGGCGGT 


GTTGACATAA 


2220 



f 



# 



ATACCACTGG CGGTGATACT GAGCACATCA GCAGGACGCA 
GCTCTTAAAA ATTAAGCCCT GAAGAAGGGC AGCATTCAAA 
TGATACGAAA CGAAGCATTG GCCGTAAGTG CGATTCCGGA 
CGCGGGGGGT TTTCGTTCAG GACTACAACT GCCACACACC 
GAATCCAGAT GGATGCACAA ACACGCCGCC GCGAACGTCG 
GGAAAGCAGC AAATCCCCTG TTGGTTGGGG T AAGCG C AAA 
TTAACTATAA ACGCTGATGG AAGCGTTTAT GCGGAAGAGG 
AAAAAAACAA CAGCATAAAT AACCCCGCTC TTACACATTC 
AAATTAAACC ACACCTATGG TGTATGCATT TATTTG CAT A 
AAGGAAATAC TTACATATGT CTCGTTGTTC TCGTAAACCA 
GCTCGGCTGG GACGACTGGA TCATCGCGCC GCTGGACTAC 
CCTTTGCGAC TTCCCTTTGC GTTCGCACCT CGAGCCCACC 
GCTGCTCAAC TCCATGGCAC CAGACGCGGC GCCGGCCTCC 
CAGCCCCATC AGCATCCTCT ACATCGACGC CGCCAACAAC 
GGACATGGTG GTGGAGGCpT GCGGCTGCAG GTAGTCTAGA 
CAGGGTAGTA CAAATAAAAA AGGCACGTCA GATGACGTGC 
AGCTTGGCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG 
CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC 
ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG 
TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT 
TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC 
CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA 
GCTGCATGTG TCAG&GGTTT TCACCGTCAT CACCGAAACG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mVl 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 28 . . 243 




CTGACCACCA 


TGAAGGTGAC 


2280 


GCAGAAGGCT 


TTGGGGTGTG 


2340 


TTAGCTGCCA 


ATGTGCCAAT 


2400 


ACCAAAGCTA 


ACTGACAGGA 


2460 


CGCAGAGAAA 


CAGGCTCAAT 


2520 


ACCAGTTCCG 


AAAGATTTTT 


2580 


TAAAGCCCTT 


CCCGAGTAAC 


2640 


CAGCCCTGAA 


AAAGGGCATC 


2700 


CATTCAATCA 


ATTGTTATCT 


2760 


CTGCATGTAG 


ATTTTAAAGA 


2820 


GAGGCGTACC 


ACTGCGAGGG 


2880 


AACCATGCCA 


TCATTCAGAC 


2940 


TGCTGTGTGC 


CAGCGCGCCT 


3000 


GTTGTCTACA 


AGCAATACGA 


3060 


GTCGACCTGC 


AGTAATCGTA 


3120 


CTTTTTTCTT 


GTGAGCAGTA 


3180 


MAAALLL 1 \jVs 


LbTTACCCM 


3 24 0 


GTAATAGCGA 


AGAGGCCCGC 


3300 


AATGGCGCCT 


GATGCGGTAT 


3360 


ATGGTGCACT 


CTCAGTACAA 


3420 


GCCAACACCC 


GCTGACGCGC 


3480 


AGCTGTGACC 


GTCTCCGGGA 


3 54 0 


CGCGA 




3 505 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATC ATC GCG CCA TTA GAC TAC 51 

Trp lie lie Ala Pro Leu Asp Tyr 
1 5 

GAG GCA TAC CAC TGC GAG GGC GTT TGC GAC TTT CCT CTG CGC TCG CAC 99 
Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His 
10 15 20 

CTG GAG CCT ACC AAC CAC GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG 14 7 

Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu Leu Asn Ser Met 
25 30- 35 40 

GCG CCC GAC GCT GCG CCA GCC TCC TGC TGC GTG CCC GCA AGG CTC AGT 195 
Ala Pro Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser 
45 50 55 

CCC ATC AGC ATT CTC TAC ATC GAT GCC GCC AAC AAC GTG GTC TAC AAG 243 
Pro He Ser He Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys 
60 €5 70 

CAATACGAGG ACATGGTGGT GGGGAATTC 2 12 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val 
1 5 10 15 

Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He 
20 25 30 

He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser 
35 40 45 

t 

Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser lie Leu Tyr lie Asd 
50 55 60 

Ala Ala Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mV2 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 28.. 243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATT ATC GCG CCC CTA GAG TAC 51 

Trp ll'e He Ala Pro Leu Glu Tyr 
1 5 

GAG GCC TAT CAC TGC GAG GGC GTG TGC GAC TTT CCG CTG CGC TCG CAC 99 
Glu Ala Tyr His Cys Glu-Gly Val Cys Asp Phe Pro Leu Arg Ser His 
10 15 20 

CTT GAG CCC ACT AAC CAT GCC ATC ATT CAG ACG CTG ATG AAC TCC ATG 14 7 

Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu Met Asn Ser Met 
25 30 * 35 40 

GAC CCG GGC TCC ACC CCG CCT AGC TGC TGC GTT CCC ACC AAA CTG ACT 195 
Asp Pro Gly Ser Thr Pro Pro Ser Cys Cys Val Pro Thr Lys Leu Thr 
45 50 55 

CCC ATT AGC ATC CTG TAC ATC GAC GCG GGC AAT AAT GTA GTC TAC AAG 243 
Pro He Ser He Leu Tyr He Asp Ala Gly Asn Asn Val Val Tyr Lys 
60 65 70 

CAATACGAGG ACATGGTGGT GGGGAATTC 272 

>-*•* 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Trp He He Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu Gly Val 
1 5 10 15 

Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He K 
20 25 30 

He Gin Thr Leu Met Asn Ser Met Asp Pro Gly Ser Thr Pro Pro Ser 
35 40 45 

Cys Cys Val Pro Thr Lys Leu Thr Pro He Ser He Leu Tyr He Asp 
50 * 55 60 

Ala Gly Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mV9 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 28.. 243 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATC ATC GCA CCT CTT GAG TAT 51 

Trp lie He Ala Pro Leu Glu Tyr 
1 5 

GAG GCC TTC CAC TGC GAA GGA CTG TGT GAG TTC CCC TTG CGC TCC CAC 9 9 

Glu Ala Phe His Cys Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His 
10 15- 20 

TTG GAG CCC ACA AAC CAC GCA GTC ATT CAG ACC CTA ATG AAC TCT ATG 14 7 

Leu Glu Pro Thr Asn His Ala Val He Gin Thr Leu Met Asn Ser Met 
25 30 35 40 

GAC CCT GAA TCC ACA CCA CCC ACT TGT TGT GTG CCT ACA CGG CTG AGT 195 
Asp Pro Glu Ser Thr Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser 
45 50 55 

CCT ATT AGC ATC CTC^TTC ATC GAC TCT GCC AAC AAC GTG GTG TAT AAA 24 3 

Pro He Ser He Leu Phe He Asp Ser Ala Asn Asn Val Val Tyr Lys 
60 65 70 

CAATACGAGG ACATGGTGGT GGGGAATTC 2 72 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Trp He He Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu 
15 10 15 

Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala Val 
20 2 5 3 0 

He Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr 
35 40 45 

Cys Cys Val Pro Thr Arg Leu Ser Pro He Ser He Leu Phe He Asp 
50 55 60 

Ser Ala Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: BMP/TGF-beta consensus sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Trp Xaa Asp Trp He Xaa Ala 

1 5 ■" 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 v base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #1 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
CGGATCCTGG VANGAYTGGA THRTNGC 27 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: BMP/TGF-beta consensus sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

His Ala He Xaa Gin Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

{vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide 82 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTCTAGAAR NGTYTGNACD ATNGCRTG 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CCACTGCGAG GGCCTTTGCG ACTTCCCTTT GCGTTCGCAC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANpEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide #4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 
TGCGGATCCA GCCGCTGCAG CCGCAAGCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GACTCTAGAC TACCTGCAGC CGCAGGCCT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide #6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCGGATCCAA GGAGCTCGGC TGGGACGA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGAATTCCCC ACCACCATGT CCTCGTAT 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Human VL-1 protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 964 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 6 05.. 964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

G AAT TCG GAT CTC TCG CAC ACT CCT CTC CGG AGA CAG AAG TAT TTG 
Asn Ser Asp Leu Ser His Thr Pro Leu Arg Arg Gin Lys Tyr Leu 
-201-200 -195 -190 

TTT GAT GTG TCC ATG CTC TCA GAC AAA GAA GAG CTG GTG GGC GCG GAG 
Phe Asp Val Ser Met Leu Ser Asp Lys Glu Glu Leu Val Gly Ala Glu 
-185 -180 -175 
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CTG CGG CTC TTT CGC CAG GCG CCC TCA GCG CCC TGG GGG CCA CCA GCC 142 
Leu Arg Leu Phe Arg Gin Ala Pro Ser Ala Pro Trp Gly Pro Pro Ala 
-170 -165 -160 -155 

GGG CCG CTC CAC GTG CAG CTC TTC CCT TGC CTT TCG CCC CTA CTG CTG 190 
Gly Pro Leu His Val Gin Leu Phe Pro Cys Leu Ser Pro Leu Leu Leu 
J -150 -145 -140 

GAC GCG CGG ACC CTG GAC CCG CAG GGG " GCG CCG CCG GCC GGC TGG GAA 238 
Asp Ala Arg Thr Leu Asp Pro Gin Gly Ala Pro Pro Ala Gly Trp Glu 
-135 -130 -125 

GTC TTC GAC GTG TGG CAG" GGC CTG CGC CAC CAG CCC TGG AAG CAG CTG 2 86 

Val Phe Asp Val Trp Gin Gly Leu Arg His Gin Pro Trp Lys Gin Leu 
-120 -115 -110 

TGC TTG GAG CTG CGG GCC GCA TGG GGC GAG CTG GAC GCC GGG GAG GCC 334 
Cys Leu Glu Leu Arg Ala Ala Trp Gly Glu Leu Asp Ala Gly Glu Ala 
-105 " -100 -95 

GAG GCG CGC GCG CGG GGA CCC CAG CAA CCG CCG CCC CCG GAC CTG CGG 3 82 

Glu Ala Arg Ala Arg Gly Pro Gin Gin Pro Pro Pro Pro Asp Leu Arg 
-90 "* -85 -80 -75 

AGT CTG GGC TTC GGC CGG AGG GTG CGG CCT CCC CAG GAG CGG GCC CTG 430 
Ser Leu Gly Phe Gly Arg Arg Val Arg Pro Pro Gin Glu Arg Ala Leu 
-70 -65 -60 

CTG GTG GTA TTC ACQ^AGA TCC CAG CGC AAG AAC CTG TTC GCA GAG ATG 478 
Leu Val Val Phe Thr Arg Ser Gin Arg Lys Asn Leu Phe Ala Glu Met 
-55 -50 -45 

CGC GAG CAG CTG GGC TCG GCC GAG GCT GCG GGC CCG GGC GCG GGC GCC 526 
Arq Glu Gin Leu Gly Ser Ala Glu Ala Ala Gly Pro Gly Ala Gly Ala 
-40 -35 -30 

GAG GGG TCG TGG CCG CCG CCG TCG GGC GCC CCG GAT GCC AGG CCT TGG 574 
Glu Gly Ser Trp Pro Pro Pro Ser Gly Ala Pro Asp Ala Arg Pro Trp 
-25 -20 -15 

CTG CCC TCG CCC GGC CGC CGG CGG CGG CGC ACG GCC TTC GCC AGT CGC 622 
Leu Pro Ser Pro Gly Arg Arg Arg Arg Arg Thr Ala Phe Ala Ser Arg 
-10 -5 15 

CAT GGC AAG CGG CAC GGC AAG AAG TCC AGG CTA CGC TGC AGC AAG AAG 610 
His Gly Lys Arg His Gly Lys Lys Ser Arg Leu Arg Cys Ser Lys Lys 
10 15 20 

CCC CTG CAC GTG AAC TTC AAG GAG CTG GGC TGG GAC GAC TGG ATT ATC 718 
Pro Leu His Val Asn Phe Lys Glu Leu Gly Trp Asp Asp Trp He He 
25 30 35 

GCG CCC CTG GAG TAC GAG GCC TAT CAC TGC GAG GGT GTA TGC GAC TTC 766 
Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe 
4 0 * 4 5 50 

CCG CTG CGC TCG CAC CTG GAG CCC ACC AAC CAC GCC ATC ATC CAG ACG 814 
Pro Leu Arq Ser His Leu Glu Pro Thr Asn His Ala He lie Gin Thr 
55 60 65 70 

CTG ATG AAC TCC ATG GAC CCC GGC TCC ACC CCG CCC AGC TGC TGC GTG 86 2 

Leu Met Asn Ser Met Asp Pro Gly Ser Thr Pro Pro Ser Cys Cys Val 
75 80 85 

CCC ACC AAA TTG ACT CCC ATC AGC ATT CTA TAC ATC GAC GCG GGC AAT 910 
Pre ^hr Lys Leu Thr Pro He Ser He Leu Tyr He Asp Ala Gly Asn 
90 95 130 
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AAT GTG GTC TAC AAG CAG TAC GAG GAC ATG GTG GTG GAG TCG TGC GGC 958 
Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly 
105 " HO 115 

TGC AGG TAGCGGTGCC TTTCCCGCCG CCTTGGCCCG GAACCAAGGT GGGCCAAGGT 1014 
Cys Arg 
120 

CCGCCTTGCA GGGGAGGCCT GGCTGCAGAG AGGCGGAGGA GGAAGCTGGC GCTGGGGGAG 1074 

GCTGAGGGTG AGGGAACAGC CTGGATGTGA GAGCCGGTGG GAGAGAAGGG AGCGCACCTT 1134 

CCCAGTAACT TCTACCTGCC AGCCCAGAGG GAAATAT 1171 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asn Ser Asp Leu Ser His Thr Pro Leu Arg Arg Gin Lys Tyr Leu Phe 
-201 -200 "195 -190 

Asp Val Ser Met LeiTser Asp Lys Glu Glu Leu Val Gly Ala Glu Leu 
-185 -180 -175 -170 

Arg Leu Phe Arg Gin Ala Pro Ser Ala Pro Trp Gly Pro Pro Ala Gly 
-165 -160 -155 

Pro Leu His Val Gin Leu Phe Pro Cys Leu Ser Pro Leu Leu Leu Asp 
-150 -145 -140 

Ala Arg Thr Leu Asp Pro Gin Gly Ala Pro Pro Ala Gly Trp Glu Val 
-135 -130 -125 

Phe Asp Val Trp Gin Gly Leu Arg His Gin Pro Trp Lys Gin Leu Cys 
-120 -115 "HO 

Leu Glu Leu Arg Ala Ala Trp Gly Glu Leu Asp Ala Gly Glu Ala Glu 
-105 -100 -95 -90 

Ala Arg Ala Arg Gly Pro Gin Gin Pro Pro Pro Pro Asp Leu Arg Ser 
-85 -80 -75 

Leu Gly Phe Gly Arg Arg Val Arg Pro Pro Gin Glu Arg Ala Leu Leu 
-70 -65 -60 

Val Val Phe Thr Arg Ser Gin Arg Lys Asn Leu Phe Ala Glu Met Arg 
-55 -50 -45 

Glu Gin Leu Gly Ser Ala Glu Ala Ala Gly Pro Gly Ala Gly Ala Glu 
-40 -35 -30 

Gly Ser Trp Pro Pro Pro Ser Gly Ala Pro Asp Ala Arg Pro Trp Leu 
-25 -20 -15 -10 

Pro Ser Pro Gly Arg Arg Arg Arg Arg Thr Ala Phe Ala Ser Arg His 

-5 1*5 

Gly Lys Arg His Gly Lys Lys Ser Arg Leu Arg Cys Ser Lys Lys Pro 
10 15 20 
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Leu His Val Asn Phe Lys Glu Leu Gly Trp Asp Asp Trp lie lie Ala 
25 30 35 

Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
40 45 50 55 

Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu 
60 65 70 

Met Asn Ser Met Asp Pro Gly Ser Thr Pro Pro Ser Cys Cys Val Pro 
75 80 85 

Thr Lys Leu Thr Pro He- Ser He Leu Tyr He Asp Ala Gly Asn Asn 
90 95 100 

Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly Cys 
105 110 115 

Arg 
120 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE': DNA encoding BMP 2 propeptide /BMP- 12 mature 
peptide 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .1233 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 847.. 1233 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC 4 8 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
-282 -280 -275 -270 

CTC CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG 96 
Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
-265 " -260 -255 

TTC GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG 14 4 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
-250 -245 -240 -235 

GTC CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA 192 
Va" Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
-230 "225 -220 

C-G AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA 24 0 

G V- Arg Pro T hr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
.215 -210 -205 
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GAC CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC 288 
Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 -195 -190 

CAC CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC 336 
His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 -180 -175 

CAC CAT GAA GAA TCT TTG GAA GAA CTA -CCA GAA ACG AGT GGG AAA ACA 384 
His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 -165 -160 -155 

ACC CGG AGA TTC TTC TTT -AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT 4 32 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu Glu Phe 
-150 -145 -140 

ATC ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT 4 80 

He Thr Ser Ala Glu Leu Qln Val Phe Arg Glu Gin Met Gin Asp Ala 
-135 -130 -125 

TTA GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC 528 
Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He 
-120 -115 -110 

ATA AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG 576 
He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
-105 -100 -95 

GAC ACC AGG TTG GTG A£T CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT 624 
Asp Thr Arg Leu Val'Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
-90 -85 -80 -75 

GTC ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT 672 
Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
-70 -65 -60 

GGA TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC 72 0 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
-55 -50 -45 

AAG AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC 768 
Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
-40 -35 -30 

TGG TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA 8T6 
Tro Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
-25 -20 -15 

GGG CAT CCT CTC CAC AAA AGA GAA AAA CGT ACG GCG TTG GCC GGG ACG 864 
Gly His Pro Leu His Lys Arg Glu Lys Arg Thr Ala Leu Ala Gly Thr 
-10 -5 15 

CGG ACA GCG CAG GGC AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC 912 
Arc Thr Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg 
10 15 2 0 

AGG GGC CGG AGC CGC TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG 56 0 

Ara Gly Arg Ser Arg Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys 
25 " 30 35 

GAG CTC GGC TGG GAC GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG 1008 
Glu Leu Gly Trp Asp Asp Trp lie He Ala Pro Leu Asp Tyr Glu Ala 
40 45 50 

TAC CAC TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG 10 56 

Tv v His Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu 
"== ' 60 65 70 
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CCC ACC AAC CAT GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA 1104 
Pro Thr Asn His Ala He He Gin Thr Leu. Leu Asn Ser Met Ala Pro 
75 80 85 

GAC GCG GCG CCG GCC TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC 1152 
Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He 
90 95 100 

AGC ATC CTC TAC ATC GAC GCC GCC AAC - AAC < GTT GTC TAC AAG CAA TAC 1200 
Ser He Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr 
105 110 115 

GAG GAC ATG GTG GTG GAG -GCC TGC GGC TGC AGG 1233 
Glu Asp Met Val Val Glu Ala Cys Gly Cys Arg 
120 125 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
-282 -280 -275 -270 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys v 
-265 -260 -255 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
-250 -245 -240 -235 

Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
-230 -225 -220 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
-215 -210 -205 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 ~ -195 -190 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 ~ -180 -175 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 -165 -160 -155 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu Glu Phe 
-150 -145 -140 

He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
-135 -130 -125 

Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu lie 
-120 -115 -110 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
-105 -100 -95 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
-90 -85 -80 -75 



68 




Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
-70 -65 -60 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
-55 -50 -45 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
-40 -35 -30 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
-25 -20 -15 

Gly His Pro Leu His Lys Arg Glu Lys Arg Thr Ala Leu Ala Gly Thr 
-10 -5 15 

Arg Thr Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg 
10 15 20 

Arg Gly Arg Ser Arg Cys "ser Arg Lys Pro Leu His Val Asp Phe Lys 
25 " 30 35 

Glu Leu Gly Trp Asp Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala 
4 0 45 50 

Tyr His Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu 
55 60 65 70 

Pro Thr Asn His Ala He He Gin Thr Leu Leu Asn Ser Met. Ala Pro 
75^. 80 85 

Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He 
90 95 100 

Ser He Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr 
105 " 110 115 

Glu Asp Met Val Val Glu Ala Cys Gly Cys Arg 
120 125 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: murine MV1 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2. .721 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

A AAG TTC TGC CTG GTG CTG GNG NCG GTG ACG GCC TCG GAG AGC AGN 4 6 

Lys Phe Cys Leu Val Leu X01 X02 Val Thr Ala Ser Glu Ser X03 
15 10 15 

CNG CTG GCC CTG AGA CGA CTG GGC TTC GGC TGN CCG GGC GGT GGC GAC 94 
X04 Leu Ala Leu Arg Arg Leu Gly Phe Gly X05 Pro Gly Gly Gly Asp 
20 25 30 
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GGC GGC GGC ACT GCG GNC GAG GAG CGC GCG CTG TTG GTG ATC TCC TCC 142 
Gly Gly Gly Thr Ala X06 Glu Glu Arg Ala Leu Leu Val lie Ser Ser 
35 40 45 

CGT ACG CAA AGG AAA GAG AGT CTG TTC CGG GAG ATC CGA GCC CAG GCC 190 
Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu He Arg Ala Gin Ala 
50 ^ 55 60 

CGT GCT CTC CGG GCC GCT GCA GAG CCG CCA CCG GAT CCA GGA CCA GGC 238 
Arg Ala Leu Arg Ala Ala Ala Glu Pro Pro Pro Asp Pro Gly Pro Gly 
65 70 75 

GCT GGG TCA CGC AAA GCC- -AAC CTG GGC GGT CGC AGG CGG CAG CGG ACT 286 
Ala Gly Ser Arg Lys Ala Asn Leu Gly Gly Arg Arg Arg Gin Arg Thr 
80 85 90 95 

GCG CTG GCT GGG ACT CGG GGA GNG NAG GGA AGC GGT GGT GGC GGC GGT 334 
Ala Leu Ala Gly Thr Arg Qiy X07 X08 Gly Ser Gly Gly Gly Gly Gly 
100 105 110 

GGC GGT GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GCA 382 
Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ala 
115 120 125 

GGC AGG GGC CAC GGG CGC AGA GGC CGG AGC CGC TGC GGT CGC AAG TCA 430 
Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg Cys Gly Arg Lys Ser 
130 135 140 

CTG CAC GTG GAC TTT AAG GAG CTG GGC TGG GAC GAC TGG ATC ATC GCG 4 78 

Leu His Val Asp Phe^Lys Glu Leu Gly Trp Asp Asp Trp He He Ala 
145 150 155 

CCA TTA GAC TAC GAG GCA TAC CAC TGC GAG GGC GTT TGC GAC TTT CCT 526 
Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
160 165 170 175 

CTG CGC TCG CAC CTG GAG CCT ACC AAC CAC GCC ATC ATT CAG ACG CTG 574 
Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu 
- 180 185 190 

CTC AAC TCC ATG GCG CCC GAC GCT GCG CCA GCC TCC TGC TGC GTG CCC 6 22 

Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser Cys Cys Val Pro 
195 200 205 

GCA AGG CTC AGT CCC ATC AGC ATT CTC TAC ATC GAT GCC GCC AAC AAC 6 70 

Ala Arg Leu Ser Pro He Ser He Leu Tyr He Asp Ala Ala Asn Asn 
210 215 220 

GTG GTC TAC AAG CAG TAC GAA GAC ATG GTG GTG GAG GCC TGC GGC TGC 718 
Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ala Cys Gly Cys 
225 ^ 230 235 



AGG TAGCATGCGG TCTGGGGAGG GTCTGGCCGC CCAGGACCCT AGCTCAAGAG 771 

Arg 

240 



CAGGTGTCAT 


CAGGCCCGAG 


GGACGGCGGA 


CTATGGCCTC 


TGCCAGCACA 


G AGG AG AGC A 


831 


CACAGTTAAC 


ACTCACATTT 


ACACACTCCT 


TCACTCACGC 


ACATGTTTAC 


CGTGGACGGC 


891 


AGGCGCTAAA 


AGCCTTGCTT 


ATTTGCT AC C 


ATTGATACAA 


ACCTCTGTCC 


TTTTCGGGAG 


951 


AGGGAAGGGC 


ATCTGTGTTT 


ATGTTGCAGT 


AATTGGCACT 


AAATCCAAGT 


AGAAATGGGT 


1011 


TAGCATTGGA 


TTCTCCTTTT 


AGTTGGAGGC 


GGTGTGGCTG 


GATTCCTGAC 


GTTGGATATG 


1071 


GAGTGCAC7G 


CAGGGCTGGG 


AT AC C C AG AT 


TCTCTGGAGT 


GGGCATTGGG 


AACCTTCAAA 


1131 
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# 



AGTAAGGAGC CACTGGGGCT TGGGAGGGAG CACCCGGTTC CTAAACAAGT CTGATGTGTA 1191 
CTGCTCAGTT TG 1203 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Lys Phe Cys Leu Val Leu X01 X02 Val Thr Ala Ser Glu Ser X03 X04 

1 * 5 10 15 

>»>* 

Leu Ala Leu Arg Arg Leu Gly Phe Gly X05 Pro Gly Gly Gly Asp Gly 
20 25 30 

Gly Gly Thr Ala X06 Glu Glu Arg Ala Leu Leu Val lie Ser Ser Arg 
35 40 45 

Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie Arg Ala Gin Ala Arg . 
50 55 60 

Ala Leu Arg Ala Ala Ala Glu Pro Pro Pro Asp Pro Gly Pro Gly Ala 
65 ^10 75 80 

Gly Ser Arg Lys Ala Asn Leu Gly Gly Arg Arg Arg Gin Arg Thr Ala 
85 90 95 

Leu Ala Gly Thr Arg Gly X07 X08 Gly Ser Gly Gly Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ala Gly 
115 120 125 

Arg Gly His Gly Arg Arg Gly Arg Ser Arg Cys Gly Arg Lys Ser Leu 
130 135 140 

His Val Asp Phe Lys Glu Leu Gly Trp Asp Asp Trp He He Ala Pro 
145 150 155 160 

Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro Leu 
165 170 175 

Arg Ser His Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu Leu 
180 185 190 

Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala 
195 200 205 

Arg Leu Ser Pro He Ser He Leu Tyr He Asp Ala Ala Asn Asn Val 
210 215 220 

Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ala Cys Gly Cys Arg 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1046 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 


MOLECULE TYPE: DNA 


(genomic) 


(iii) 


HYPOTHETICAL: NO 




(ivj 


ANTI- SENSE: NO 




(vii) 


IMMEDIATE SOURCE: 
(B) CLONE: MURINE 


MV2 


(ix) 


FEATURE : 





(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 7 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

A AGA AAA CAA GCT TGC ATT CCT GCA GGT CCG ACT CTA AGA GGA TCC 4 6 

Ara Lys Gin Ala Cys He Pro Ala Gly Pro Thr Leu Arg Gly Ser 
1 5 10 15 

TCA GGG ACC CAA CCC AGG CCG GCT GGG AAG TCT TTC GAC GTG TGG CAG 94 
Ser Gly Thr Gin Pro Arg Pro Ala Gly Lys Ser Phe Asp Val Trp Gin 
20 25 30 

GGC CTG CGC CCT CAG CCT TGG AAG CAG CTG TGC CTG GAG TTG CGG GCA 142 
Glv Leu Arg Pro Gln^>ro Trp Lys Gin Leu Cys Leu Glu Leu Arg Ala 
35 40 45 

GCC TGG GGT GAG CTG GAC RCC GGG GAT ACG GGG GCG CGC GCG AGG GGT 190 
Ala Trp Gly Glu Leu Asp X01 Gly Asp Thr Gly Ala Arg Ala Arg Gly 
50 55 60 

CCC CAG CAG CCA CCG CCT CTG GAC CTG CGG AGT CTG GGC TTC GGT CGG 238 
Pro Gin Gin Pro Pro Pro Leu Asp Leu Arg Ser Leu Gly Phe Gly Arg 
65 70 75 

AGG GTG AGA CCG CCC CAG GAG CGC GCC CTG CTT GTA GTG TTC ACC AGA 286 
Ara Val Arg Pro Pro Gin Glu Arg Ala Leu Leu Val Val Phe Thr Arg 
80 85 90 95 

TCG CAG CGC AAG AAC CTG TTC ACT GAG ATG CAT GAG CAG CTG GGC TCT 334 
Ser Gin Arg Lys Asn Leu Phe Thr Glu Met His Glu Gin Leu Gly Ser 
100 105 110 

GCA GAG GCT GCG GGA GCC GAG GGG TCA TGT CCA GCG CCG TCG GGC TCC 382 
Ala Glu Ala Ala Gly Ala Glu Gly Ser Cys Pro Ala Pro Ser Gly Ser 
115 120 125 

CCA GAC ACC GGG TCT TGG CTG CCC TCG CCC GGC CGC CGG CGG CGA CGC 430 
Pro Asp Thr Gly Ser Trp Leu Pro Ser Pro Gly Arg Arg Arg Arg Arg 
130 135 140 

ACC GCC TTC GCC AGC CGT CAC GGC AAG CGA CAT GGC AAG AAG TCC AGG 4 78 

Thr Ala Phe Ala Ser Arg His Gly Lys Arg His Gly Lys Lys Ser Arg 
145 150 155 

CTG CGC TGC AGC AGA AAG CCT CTG CAC GTG AAT TTT AAG GAG TTA GGC 526 
Leu Ara Cys Ser Arg Lys Pro Leu His Val Asn Phe Lys Glu Leu Gly 
160 " 165 l* 70 1/b 

TGG GAC GAC TGG ATT ATC GCG CCC CTA GAG TAC GAG GCC TAT CAC TGC 57 4 

Trp Aso Asp Trp lie lie Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys 
180 185 190 
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GAG GGC GTG TGC GAC TTT CCG CTG CGC TCG CAC CTT GAG CCC ACT AAC 622 
Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn 
195 200 205 

CAT GCC ATC ATT CAG ACG CTG ATG AAC TCC ATG GAC CCG GGC TCC ACC 670 
His Ala He He Gin Thr Leu Met Asn Ser Met Asp Pro Gly Ser Thr 
210 215 220 

CCG CCT AGC TGC TGC GTT CCC ACC AAA CTG ACT CCC ATT AGC ATC CTG 718 
Pro Pro Ser Cys Cys Val Pro Thr Lys Leu Thr Pro He Ser He Leu 
225 230 235 

TAC ATC GAC GCG GGC AAT AAT GTN GTC TAC AAG CAG TAT GAG GAC ATG 766 
Tyr He Asp Ala Gly Asn Asn X02 Val Tyr Lys Gin Tyr Glu Asp Met 
240 *" 245 250 255 

GTG GTG GAG TCC TGC GGC TGT AGG TAGCGGTGCT GTCCCGCCAC CTGGGCCAGG 82 0 
Val Val Glu Ser Cys Gly Cys Arg 
260 

GACCATGGAG GGAGGCCTGA CTGCCGAGAA AGGAGCAGGA GCTGGCCTTG GAAGAGGCCA 8 80 

CAGGTGGGGG ACAGCCTGAA AGTAGGAGCA CAGTAAGAAG CAGCCCAGCC TTCCCAGAAC 94 0 

CTTCCAATCC CCCAACCCAG AAGCAGCTAA GGGGTTTCAC AACTTTTGGC CTTGCCAGCC 1000 

TGGAAAGACT AGACAAGAGG GATTCTTCTC TTTTTATTAT GGCTTG 1046 

(2) INFORMATION FORr^SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Arq Lys Gin Ala Cys He Pro Ala Gly Pro Thr Leu Arg Gly Ser Ser 
15 10 15 

Gly Thr Gin Pro Arg Pro Ala Gly Lys Ser Phe Asp Val Trp Gin Gly 
20 ~ 25 30 

Leu Arg Pro Gin Pro Trp Lys Gin Leu Cys Leu Glu Leu Arg Ala Ala 
3 5 4 0 4 5 

Trp Gly Glu Leu Asp X01 Gly Asp Thr Gly Ala Arg Ala Arg Gly Pro 
50 SS 60 

Gin Gin Pro Pro Pro Leu Asp Leu Arg Ser Leu Gly Phe Gly Arg Arg 
65 70 75 80 

Val Arg Pro Pro Gin Glu Arg Ala Leu Leu Val Val Phe Thr Arg Ser 
85 " 90 95 

Gin Arg Lys Asn Leu Phe Thr Glu Met His Glu Gin Leu Gly Ser Ala 
100 105 HO 

Glu Ala Ala Gly Ala Glu Gly Ser Cys Pro Ala Pro Ser Gly Ser Pre 
U5 120 125 

Aso Thr Gly Ser Trp Leu Pro Ser Pro Gly Arc Arg Arg Arg Arg Thr 
" 130 135 140 
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Ala Phe Ala Ser Arg His Gly Lys Arg His Gly Lys Lys Ser Arg Leu 
145 150 155 160 

Arg Cys Ser Arg Lys Pro Leu His Val Asn Phe Lys Glu Leu Gly Trp 
165 170 175 

Asp Asp Trp He He Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu 
180 185 190 

Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His 
195 200 205 

Ala He He Gin Thr Leu ..Met Asn Ser Met Asp Pro Gly Ser Thr Pro 
210 215 220 

Pro Ser Cys Cys Val Pro Thr Lys Leu Thr Pro He Ser He Leu Tyr 
225 230 235 240 

He Asp Ala Gly Asn Asn Xt)2 Val Tyr Lys Gin Tyr Glu Asp Met Val 
245 250 255 

Val Glu Ser Cys Gly Cys Arg 
260 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANB15DNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: HUMAN Vl-1 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 138.. 1301 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 990.. 1301 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

AACTATAGCA CCTGCAGTCC CTGGTCTTGG GTGTAGGGGT GCGCTCCTGG TCCCGCGGCT 6 0 

C.-.GGGATATG CAGTGACCAA TGGGTTGTTG GCCTGATGGG ACTTTTGGCT TGCTAAACCA 12 0 

AAGCTCGGTT CGGATAG CCC GGG CGA AGA CGT CCG CTG CTC TGG GCC AGG 17 0 

Pro Gly Arg Arg Arg Pro Leu Leu Trp Ala Arg 
-284 -280 -275 

C73 GCA GCG TTC AGG CTG GGG CAG AGA CGC GGA GTC GGG CGC TGG CTC 218 
Leu Ala Ala Phe Arg Leu Gly Gin Arg Arg Gly Val Gly Arg Trp Leu 
-270 -265 -260 
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CAA CAG GCC TGG CTC CCA CAT CGA AGA CAG CTG GGC CAT TTG CTG TTA 266 
Gin Gin Ala Trp Leu Pro His Arg Arg Gin Leu Gly His Leu Leu Leu 
-255 -250 -245 

GGA GGC CCC GCG CTG ACA GTG TGC AGG ATT TGC TCT TAC ACA GCT CTT 314 
Gly Gly Pro Ala Leu Thr Val Cys Arg lie Cys Ser Tyr Thr Ala Leu 
-240 -235 -230 

TCT CTC TGT CCC TGC CGG TCC CCC GCA. .GAC. GAA TCG GCA GCC GAA ACA 362 
Ser Leu Cys Pro Cys Arg Ser Pro Ala Asp Glu Ser Ala Ala Glu Thr 
-225 -220 -215 -210 

GGC CAG AGC TTC CTG TTC. GAC GTG TCC AGC CTT AAC GAC GCA GAC GAG 410 
Gly Gin Ser Phe Leu Phe Asp Val Ser Ser Leu Asn Asp Ala Asp Glu 
-205 -200 -195 

GTG GTG GGT GCC GAG CTG CGC GTG CTG CGC CGG GGA TCT CCA GAG TCG 4 58 

Val Val Gly Ala Glu Leu Arg Val Leu Arg Arg Gly Ser Pro Glu Ser 
-190 v " -185 ~ -180 

GGC CCA GGC AGC TGG ACT TCT CCG CCG TTG CTG CTG CTG TCC ACG TGC 506 
Gly Pro Gly Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys 
-175 -170 -165 

CCG GGC GCC GCC CGA GCG CCA CGC CTG CTG TAC TCG CGG GCA GCT GAG 554 
Pro Gly Ala Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu 
-160 -155 -150 

CCC CTA GTC GGT CAG CGC TGG GAG GCG TTC GAC GTG GCG GAC GCC ATG ' 6 02 

Pro Leu Val Gly Gln-Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met 
-145 -140 -135 -130 

AGG CGC CAC CGT CGT GAA CCG CGC CCC CCC CGC GCG TTC TGC CTC TTG 6 50 

Arg Arg His Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu 
-125 -120 -115 

CTG CGC GCA GTG GCA GGC CCG GTG CCG AGC CCG TTG GCA CTG CGG CGA 6 98 

Leu Arg Ala Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg 
-110 -105 -100 

CTG GGC TTC GGC TGG CCG GGC GGA GGG GGC TCT GCG GCA GAG GAG CGC 74 6 

Leu Gly Phe Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg 
-95 -90 -85 

GCG GTG CTA GTC GTC TCC TCC CGC ACG CAG AGG AAA GAG AGC TTA TTC ISA 
Ala Val Leu Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe 
-80 -75 ~ -70 

CGG GAG ATC CGC GCC CAG GCC CGC GCG CTC GGG GCC GCT CTG GCC TCA 84 2 

Arg Glu lie Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser 
-65 -60 -55 -50 

GAG CCG CTG CCC GAC CCA GGA ACC GGC ACC GCG TCG CCA AGG GCA GTC 8 90 

Glu Pro Leu Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val 
-45 -40 -35 

ATT GGC GGC CGC AGA CGG AGG AGG ACG GCG TTG GCC GGG ACG CGG ACA 93 8 

lie Glv Gly Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr 
-30 -25 -20 

GCG CAG GGC AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC AGG GGC 9 86 

Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly 
-15 -10 -5 

CGG AGC CGC TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG GAG CTC 1C24 
Arc Ser Arg Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu 
1 " J 5 10 IS 
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GGC TGG GAC GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG TAC CAC 1082 
Gly Trp Asp Asp Trp lie He Ala Pro Leu Asp Tyr Glu Ala Tyr His 
20 25 30 

TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC 1130 
Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
35 40 45 

AAC CAT GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA GAC GCG 1178 
Asn His Ala He He Gin Thr Leu LeuAsn Ser Met Ala Pro Asp Ala 
50 55 60 

GCG CCG GCC TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC AGC ATC 1226 
Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser He 
65 70 75 

CTC TAC ATC GAC GCC GCC AAC AAC GTT GTC TAC AAG CAA TAC GAG GAC 12 74 

Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp 
80 85 V 90 95 

ATG GTG GTG GAG GCC TGC GGC TGC AGG TAGCGCGCGG GCCGGGGAGG 1321 
Met Val Val Glu Ala Cys Gly Cys Arg 
100 

GGGCAGCCAC GCGGCCGAGG ATCC 134 5 



(2) INFORMATION FOR SEQ ID NO:34: 

( i ) SEQUENCER-CHARACTERISTICS : 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Pro Gly Arg Arg Arg Pro Leu Leu Trp Ala Arg Leu Ala Ala Phe Arg 
-284 -280 -275 -270 

Leu Gly Gin Arg Arg Gly Val Gly Arg Trp Leu Gin Gin Ala Trp Leu 
-265 -260 -255 

Pro His Arg Arg Gin Leu Gly His Leu Leu Leu Gly Gly Pro Ala Leu 
-250 -245 -240 

Thr Val Cys Arg He Cys Ser Tyr Thr Ala Leu Ser Leu Cys Pro Cys 
-235 -230 -225 

Arg Ser Pro Ala Asp Glu Ser Ala Ala Glu Thr Gly Gin Ser Phe Leu 
-220 -215 -210 -205 

Phe Asp Val Ser Ser Leu Asn Asp Ala Asp Glu Val Val Gly Ala Glu 
-200 -195 -190 

Leu Arg Val Leu Arg Arg Gly Ser Pro Glu Ser Gly Pro Gly Ser Trp 
-185 -180 -175 

Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala Ala Arg 
-170 -165 -160 

Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val Gly Gin 
-155 " -150 " -145 

Arg Trz> Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His Arg Arg 
-140 " -135 -130 -125 



76 



Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala Val Ala 
-120 * -115 -110 

Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe Gly Trp 
-105 -100 -95 

Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu Val Val 
-90 -85 -80 



Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie Arg Ala 
-75 -70 -65 

Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu Pro Asp 
-60 " -55' -50 -45 

Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val He Gly Gly Arg Arg 
-40 -35 -30 

Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly Ser Gly 
-25 -20 -15 

Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg Cys Ser 
-10 -5 1 

Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp Asp Trp 
5 10 15 20 

He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Leu Cys 
25 " 30 35 

Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He He 
40 ~ 45 50 

Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser Cys 
55 60 65 

Cys Val Pro Ala Arg Leu Ser Pro He Ser He Leu Tyr He Asp Ala 
70 75 80 

Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ala 
85 90 95 100 

Cys Gly Cys Arg 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: primer number 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TC-7.-.TGCGAC TTCCCGC I 7 
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